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ABSTRACT 

When  testing  with  the  t-test,  it  is  assumed  that  the 

sample  under  investigation  is  from  a  norm.al  population.   The 

purpose  of  this  thesis  is  to  examine  the  sensitivity  of  the 

t-test  to  violations  of  this  norm.ality  assumption.   A  com.- 

puter  simulation  was  performed  to  draw  sets  of  10,000  samples 

from  an  infinite  V/eibull  population.   A  t-test  v;as  perform.ed 

on  each  sample  to  test  the  null  hypothesis  K  :  p  <_  y   where 

\i      was  the  true  mean  of  the  V/eibull  population.   The  number 

of  times  that  H  was  rejected  was  recorded  for  all  com.bina- 

o 

tions  of  eight  levels  of  significance,  samples  ranging  in 
size  from  2  to  31,  and  for  values  of  the  parameters  of  the 
distribution  X   =  1,2,3  and  3  =  1,2,3. 
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I.   INTRODUCTION 

A.   GENERAL 

In  statistical  experimentation  it  is  sometimes  desired 

to  test  the  assumption  that  the  mean,  y,  of  a  statistical 

population  is  in  some  v;ay  related  to  a  hypothesized  value 

y  .   To  evaluate  the  assumption,  a  null  hypothesis  H   is 

formulated  about  y ,  and  tested  against  an  alternative 

hypothesis,  H,  by  taking  a  sample  from  the  population  under 

X  -  y 

Investigation  and  forming  the  t-statistic  t  =  ^^  .   In 

_    -^   n  S//n 

this  transformation  x  =  —  Z  x,  is  the  sample  average, 

-,    n       2        ^~1 
S  =  — ^  Z  (x,  -x)   is  the  unbiased  sample  standard  deviation, 
n-1  k=i   ^ 

n  is  the  sample  size,  and  y   is  the  hypothesized  population 
mean. 

The  cumulative  t-distribution  has  been  tabled  for  various 
values  of  n,  and  levels  of  significance  a.   When  testing  the 
null  hypothesis  H  :  y  <_  y   against  the  alternative  hypothe- 
sis H:  y  >  y  ,  the  procedure  is  to  reject  H   if  the  calcu- 
lated t  is  greater  than  or  equal  to  the  tabled  (critical) 

t,   -,  N  / -,   N  for  n-1  degrees  of  freedom  at  the  1-a  conficence 
(n-l)(l-a)  ^ 

level  where  t,      -.  \  / -^       %  is  obtained  from  the  upper  tail  of 
(n-1 ) ( 1-a)  ^^ 

the  t-distribution.   Similarly,  if  the  null  hypothesis 

^o '  ^  ^  ^   ^^  tested  against  H:  y  <  y  ,  the  null  hypothesis 

is  rejected  if  t  <  -t/   -,  n  / -,   n  where  -t ,      -,  s  / -,       n  is 
"^  —        (n-l)(l-a)  (n-l)(l-a) 

obtained  from  the  lower  tail  of  the  t-distribution.   In  the 
case  of  a  two  trailed  test,  the  null  hypothesis  H  :  y  =  y^ 


is  tested  against  H :  u  7^  y   and  is  rejected  if 


t  <  -t 


(n-l)(l-a/2)  o^  if  t  >  t ^^_^^ ^^_^/^^     [Ref.  5]. 


B.   BACKGROUND  AND  PURPOSE 

When  testing  with  the  t-test  it  is  assumed  that  the 
sample  under  investigation  is  from  a  normal  population.   In 
general,  the  purpose  of  this  thesis  is  to  examine  the 
sensitivity  of  the  t-test  to  certain  violations  of  this 
normality  assumption. 

Some  previous  writers,  e.g.  Bartlett  [1]  have  investi- 
gated the  theoretical  distribution  of  the  t  statistic,  when 
sampling  from  an  infinite  non-normal  population.   Bartlett 
concludes  from  his  study  that  even  though  his  work  was 
Incomplete,  and  not  of  m.uch  quantitative  value,  it  does 
indicate  that  for  moderate  departures  from  normality  the 
t-test  may  still  be  used  with  confidence,  particularly  for 
testing  differences  in  means  of  equal  numbers  of  observations, 

In  a  different  approach,  Pearson  [6]  describes  how  he 
mechanically  drew  samiples  from  an  infinite  non-norm.al  popu- 
lation.  In  the  case  where  the  m.eans  of  only  tv;o  samples 
were  being  tested  for  equivalence,  the  value  of  t  was  cal- 
culated for  each  sample  to  empirically  obtain  some  idea  of 
the  frequency  distribution.   Using  a  chi  square  test  to  fit 
the  observed  t  to  a  theoretical  t  distribution  did  not 

appear  to  bring  out  any  system.atic  discrepancy.   Taken  as  a 

2 
whole  the  values  of  x   were  higher  than  should  be  expected 

if  the  variation  from  theory  v/as  solely  due  to  chance.   Also 


the  fits  on  the  whole  were  better  for  larger  size  samples. 
However,  Pearson  never  drew  more  than  1000  samples.   A 
greater  number  of  samples  is  needed  to  determ.ine  the  five 
percent  point,  and  more  so  the  one  percent  point,  since  the 
number  of  rejections  at  these  levels  is  so  lov/. 

Specifically,  this  thesis  examined  the  effect  of  samipling 
from  a  non-normal  population,  on  the  number  of  tim.es  the 
null  hypothesis  was  rejected  (given  that  the  null  hypothesis 
was  true)  when  testing  v/ith  the  t-test.   The  probability  of 
such  an  event  is  commonly  known  as  a  type  I  error.   If  the 
observed  number  of  rejections  obtained  v;hen  sam.pling  from,  a 
non-normal  distribution  is  near  the  expected  number  of 
rejections  that  should  be  obtained  by  sampling  from  a  normal 
distribution,  it  will  be  possible  to  use  the  t-table  as  if 
the  sample  had  come  from  a  normal  distribution.   However,  if 
the  observed  number  of  rejections  (when  samipling  from  a  non- 
normal  distribution)  is  significantly  different  from  the 
expected  number  of  rejections  that  should  have  been  obtained 
by  sampling  from  a  normal  distribution,  it  v;ill  be  necessary 
to  adjust  the  procedure  for  using  the  t-table  to  estim.ate  a 
critical  t  value, 

C.   V/EIBULL  DISTRIBUTION 

The  non-normal  distribution  of  interest  is  the  VJeibull 

6-1   — Xx^ 
distribution  with  distribution  function  f (x)=  X6x    e 

-VB  1 
The  expected  value  of  the  random  variable  X  is  X    r(-r-+l) 

p 

and  the  variance  is  X"^^^  |  r(  2/6  +  1)  -  [r(l/6+l)]^}  [Hef.3]. 


The  Welbull  distribution  frequently  appears  in  reliability 
theory  and  life  testing,  where  the  random  variable  X  repre- 
sents the  time  between  failures.   When  the  shape  parameter 
3=1  the  Welbull  distribution  reduces  to  the  exponential 
distribution  which  has  applications  in  queueing  theory  as 
well  as  reliability  theory  and  life  testing. 

The  Welbull  distribution  takes  on  a  variety  of  shapes, 

m 

depending  on  the  value  of  the  parameter  3.   The  spread  of 
the  distribution  is  determined  by  the  value  of  the  param.eter 
A.   One  might  therefore  expect  that  the  "t-statistic" 
obtained  by  sampling  from  a  Welbull  distribution  will  some- 
how depend  on  the  values  of  B  and  A.   If  this  is  true,  the 
number  of  rejections,  given  that  the  null  hypothesis  is 
true,  will  also  depend  on  the  parameter  values.   To  examine 
this  possibility,  combinations  of  A=l,2,3  with  3  =  1,2,3 
were  used  to  develop  9  distributions  from  the  family  of 
Welbull  distributions.   Tables  I  and  II  below  give  the 
resultant  means  and  variances  for  the  9  sets  of  parameter 
values . 

TABLE  I  TABLE  II 

Means  Variances 


A^     1       2       3        A^     1       2 


1  1.00  0.886        0.893  1        1.000        0.216        0.106? 

2  0.500        0.626        0.708  2        0.250        0.108        0.0664 

3  0.333        0.511        0.618  3        0.111        0.072        0.0511 


10 


Figure  1  shov/s  the  shape  of  the  V/elbull  distribution 
for  the  parameter  values  above. 
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II.   METHOD 

A.   GENERAL 

To  examine  the  "robustness"  of  the  Student  t-test  when 
sampling  from  a  Welbull  distribution,  a  computer  sim.ulation 
was  performed  to  repeatedly  draw  samples  of  size  1  from,  an 
infinite  V/eibull  population.   A  t-statistic  was  calculated 
for  each  sample  and  used  to  test  the  null  hypothesis  that 
the  sample  was  drawn  from  a  population  with  a  mean  equal  to 
or  less  than  y  .   To  ensure  that  the  hypothesis  was  in  fact 
true,  the  hypothesized  mean  y   was  set  equal  to  the  mean  y 
of  the  Welbull  population.   Each  time  that  the  null  hypothe- 
sis was  rejected,  the  result  was  recorded  for  the  level  of 
significance,  a.,  at  which  the  test  was  conducted,  and  the 
size  of  the  sample.   The  total  number  of  observed  rejections, 
r.  .  ,  was  computed  at  each  of  eight  different  levels  of 
significance  a.,  j=l,...,8  in  the  t-test,  and  for  samples  of 
size  1  =  2 , ... ,31. 

For  comparison,  and  to  assist  in  validating  the  com.puter 
program,  the  entire  experiment  was  repeated  with  sampling 
from  a  standard  normal  distribution.   As  with  the  VJeibull 
case,  computer  simulation  was  used  to  repeatedly  draw 
samples  of  size  i.   For  each  sample,  a  t-statistic  v;as 
calculated  and  used  to  test  the  null  hypothesis  that  the 
sample  was  drawn  from  a  population  with  a  mean  y  equal  to 
or  less  than  y  .   Once  again  the  hypothesized  mean  y   was 
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set  equal  to  the  population  mean  y  =  0 .   The  number  of 
rejections  was  recorded  as  In  the  Welbull  case. 

B.   SAMPLING  TECHNIQUE 

Using  the  random  number  generator  RANDU  provided  for 
Fortran  IV  with  the  IBM  System  /36O  Source  Library,  uniform 
random  varlates  were  generated  on  the  Interval  (0,1). 
J.  N.  Bramhall  [2]  In  a  report  that  discusses  a  comparison 
of  three  uniform  random  number  generators  for  the  IBM  36O, 
fitted  RANDU  to  a  uniform  (0,1)  distribution  with  a  Chi 
Square  test  at  the  95%    confidence  level. 

The  uniform  random  varlates  obtained  from  RANDU  were 
subsequently  used  to  produce  Welbull  random  varlates  by 
the  Inverse  transformation  method  [h] .      Since  the  cumulative 
frequency  distribution  F(x)  ranges  over  the  Interval  (0,1), 
the  uniform  (0,1)  random  varlates,  V,  that  were  generated 
from  RANDU  were  set  equal  to  F(x).   Solving  the  resultant 
equation  for  x  produces  random  varlates  with  the  distribu- 
tion function  desired. 

-Xx^ 

In  the  case  of  the  Welbull  distribution  F(x)  =  1  -e 

Setting  P(x)  =  V  and  solving  for  x  yields 

F(x)  =  V  =  1  -  e  ^^ 

e"^^   =  1  -  V 

-Ax^ 
iln  e     =  In    (1-V) 

Since  the  distribution  of  V  is  symmetric 
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-Ax^  =  «n  V 


x^ 


X  = 


Here,  V  represents  uniform  random  variates  on  the  interval 
(0,1),  and  A  and  6  are  again  the  parameters  of  the  V/eibull 
distribution . 

Normal  random  variates  were  obtained  using  the  sub- 
routine GAUSS  provided  by  the  IBM  System/360  Source  Library, 
The  Central  Lim.it  Theorem  states  that  the  probability 
distribution  of  the  sum  of  n  independent  and  identically 

distributed  random  variables  X.,  with  mean  y,  and  variance 

1 '  i 

2 

a.   approaches  asymptotically  a  normal  distribution  v;ith 

mean  y  and  variance  a  ,  where  y  =   2  y .  and  a   =   Z  a.  . 

1=1  ^  1=1  ^ 

Subroutine  GAUSS  calls  RANDU  to  produce  n  uniform  random. 

uniform  random  variates  V.,  on  the  interval  (0,1).   The 

n 
expected  value  of  the  sum  E(  Z  V.  )  -   -^,    and  the  variance  of 

n  1=1 

the  sum  Var(  Z  V.  )  =  r-^  .  Making  the  transformation 

I   V.-n/2 

Z  =  — ""      =  —^i;;^^ —  yields  a  standard  norm.al  random. 

vATOO       •n7T2 
variate.   A  normal  random  variate  X  with  any  desired  mean 

2 

y   and  variance  a    is  obtained  from  X  =  a  Z  +  y  .   Sub- 

X  X  XX 

routine  GAUSS  sets  n  at  12,  which  eliminates  the  radical 
in  Z  and  speeds  up  the  computation. 


Ik 


C.   TESTING  PROCEDURE 

For  each  sample,  the  sample  average  and  standard  devia- 
tion were  calculated.   A  t-test  was  then  conducted  on  each 

sample  to  test  the  null  hypothesis  H  :  y  <  y   where  y   was 
^  "^  *  o    —  o        o 

the  true  m.ean  of  the  population  from  which  the  sariple  v;as 
drawn.   The  calculated  value  of  t  v;as  then  compared  v;ith 
the  critical  (tabled)  t  for  the  appropriate  sample  size 
i  =  2,,, .,31,  and  significance  level  a.,  j  =  1,...,8.   If 
the  calculated  t  was  equal  to  or  greater  than  the  critical 
t  the  null  hypothesis  was  rejected.   The  num.ber  of  rejec- 
tions r.  .  was  recorded  for  each  level  of  significance  a. 
at  which  the  null  hypothesis  was  tested,  and  for  sam.ples  of 
size  i  =  2  , . . . , 31 . 
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III.   METHOD  OF  ANALYSIS 

Once  the  number  of  rejections  r. .  was  determined  (given 
that  the  null  hypothesis  was  true)  for  each  level  of  sig- 
nificance a.,  j  =  1,...,8,  and  for  all  sample  sizes 
i  =  2,..., 31,  it  was  necessary  to  determine  if  the  number  of 
observed  rejections  were  significantly  different  from  the 
expected  number  of  rejections  e..   The  expected  num.ber  of 
rejections,  assuming  a  normal  population,  was  obtained  by 
taking  the  product  of  the  probability  of  a  type  I  error,  a 
(i.e.,  the  probability  of  rejecting  the  null  hypothesis  when 
in  fact  the  null  hypothesis  is  true),  and  the  number  of 
times  the  test  was  repeated  with  a  different  sample.   For 
each  size  sample,  the  null  hypothesis  y  <_  y   was  tested  for 
10,000  different  samples  at  the  a.  level  of  significance. 
The  values  of  a.  that  were  used,  and  the  resultant  expected 

number  of  rejection  e.  is  shown  below  in  Table  III. 

J 

TABLE    III 

j  2345678 

a.  .25         .20         .15         .10       .05       .025       .005       .0005 

J 

e.  =  (10,000)a.   2500   2000   1500   1000   500    250     50       5 


To  determine  if  the  observed  number  of  rejections  was 
significantly  different  from  the  expected  number  of  rejec- 
tions, a  Chi  Square  test  with  one  degree  of  freedom  was 
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conducted  to  evaluate  the  null  hypothesis  H   :  r. .  =  e . 
for  each  value  of  r. . .   Table  IV  is  the  contingency  table 
for  the  Chi  Square  test. 

TABLE  IV 

Number  of  times    Number  of  tim.es 


Expected  number 
Observed  number 


H   accepted 
o 

H 
o 

rejected 

Total 

10000-e. 
J 

"J 

10000 

10000-r^. 

"ij 

10000 

The  Chi  Square  statistic  was  obtained  by  calculating 

2    (ei  -  ^il)^    (^il-en)^  2 

X   =   innnn  "^  — *  ^^^^^   the  calculated  x   >^'as 

greater  than  the  critical  x   with  one  degree  of  freedom 
at  the  1  -  a  confidence  level  the  null  hypothesis  H   :  r.  .  =  e. 
was  rejected.   An  example  of  the  method  of  analysis  is 
given  in  Section  VII. 
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IV.   OUTPUT 


The  output  obtained  from  the  computer  simulation  was  r. ., 

the  observed  number  of  times  that  the  null  hypothesis 
H  :  y  <_  y   was  rejected,  when  in  fact  the  null  hypothesis 
was  true.   As  previously  indicated  r. .  was  obtained  for 
J  =  1,...,8  levels  of  significance  in  the  t-test,  and  for 
samples  ranging  in  size  from  i  =  2,...,  31.   For  each  r.., 
the  probability  of  a  type  I  error,  y. .  was  obtained  by 
taking  the  ratio  of  the  number  of  observed  rejections  to 

-t   •   « 

the  number  of  samples  M  =  10,000  tested,  e.g.  ,  y.  .  =  t^ttt^tt 

'^  '  J     G  5   1  -Lj    10000 

Appendices  A-J  table  the  values  of  y. .  for  each  of  the 
ten  cases  examined  (sampling  from  a  standard  normal  distri- 
bution, and  sampling  from  a  V/eibull  distribution  with  nine 
sets  of  parameter  values).   The  values  of  the  index 
1  =  2,..,,  31  again  represent  the  sam.ple  sizes  used,  and 
j  =  1,...,8  reference  the  levels  of  significance  for  which 
each  sample  was  tested  (see  Table  III).   Probabilities  that 
appear  with  an  asterisk,  i.e.  y..  represent  a  situation 
where  the  number  of  observed  rejections  r. .  was  significant- 
ly  different  from  the  expected  number  of  rejections  e.  at 

the  ,01  level  of  significance.   Probabilities  appearing  v;ith 

/ 
a  check,  i.e.  y..  represent  the  same  situation  at  the  .05 

level  of  significance. 
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V.   RESULTS 

A.  NORMAL  CASE 

In  review,  for  the  normal  case  samples  were  drawn  from 
a  standard  normal  distribution.   A  one  tailed  t-test  was 
conducted  on  each  of  10,000  different  samples  at  eight 
levels  of  significance  to  test  the  null  hypothesis 
H  :  y  <_  y  .   The  process  was  repeated  for  samples  ranging 
in  size  from  2  to  31. 

As  anticipated  the  observed  number  of  rejections  r. . 

did  nearly  equal  the  expected  number  of  rejections  e.  in 

all  cases.   In  fact  the  null  hypothesis  H   :  r. .  =  e.  was 

accepted  for  all  r. .  at  the  .01  level  of  significance  and 

ij 

rejected  for  only  eight  of  the  2^0  r. .  at  the  .05  level  of 
significance.   The  rejections  that  did  occur  v;ere  attributed 
to  the  stochastic  nature  of  the  testing  procedure. 

Appendix  A  tables  the  results  for  the  normal  case. 

B.  WEIBULL  CASE 

The  results  of  the  simulation  changed  significantly 

when  samples  were  drawn  from  a  Weibull  distribution. 

In  general,  for  a  given  value  of  the  parameter  B,  and 

for  fixed  i  and  j,  the  observed  number  of  rejections  r . . 

-'-  J 

was  relatively  insensitive  to  changes  in  the  parameter  X. 
However,  for   a  given  A ,  as  3  increased,  the  r. .  increased 
causing  a  decrease  in  the  number  of  times  that  H   :  r. .=e. 
was  rejected.   In  the  three  cases  where  6  =  Ij  with  the 
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exception  of  testing  H  :  y  <_  y   at  the  .0005  level, 

H   :  r. .  =  e.  was  rejected  for  all  r. .  at  the  .01  level. 

When  testing  H  :  y  <  y   at  the  .0005  level,  H   :  r. .  =  e. 

was  rejected  for  most  of  the  r..  at  the  .05  level.   Other- 

wise  H   :  r..  =  e.  was  accepted.   The  results  were  essen- 
o    ij     j         ^ 

tially  the  same  for  the  three  cases  where  6=2  with  only 
a  slight  decrease  in  the  total  number  of  times  H   :  r. . =  e. 

o      IJ     J 

was  rejected.   In  the  three  cases  where  3=3,  the  results 
were  closest  to  the  results  expected  if  sampling  had  been 
from  a  normal  distribution.   In  fact,  for  A  =  3  the  null 
hypothesis  H   :  r.  .  =  e.  was  accepted  for  206  of  the  2^0  r... 

With  few  exceptions,  the  values  of  r. .  tended  to 
increase  as  the  sample  size  increased.   This  increasing 
trend  was  difficult  to  detect  for  small  values  of  a, 
possibly  because  the  increase  was  masked  by  the  random 
fluctuations  in  r. . ,  where  the  r. .  were  already  small. 

A  final  result  of  the  experiment  was  that  the  r. .  were 
usually  less  than  the  expected  number  of  rejections.   In 
fact,  the  stronger  hypothesis  H  ':  r. .  <  e.  was  accepted  at 
the  .95  level  of  confidence  in  all  cases  for  all  r... 
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VI.   CONCLUSIONS 

The  results  of  the  experirient  suggest  that  the  validity 
of  the  t-test  is  sensitive  to  the  assuniption  of  normality 
if  sampling  is  done  from  a  V/elbull  distribution  with  the 
parameter  values  chosen  in  this  paper.   Accepting  the  null 
hypothesis  H  ':  r. .  <  e.  for  any  sample  size  and  level  of 
significance  implies  that  the  probability  of  rejecting  a 
true  hypothesis  is  less  when  sampling  from  a  V/eibull  distri- 
bution than  when  sampling  from  a  normal  distribution.   This 
will  tend  to  cause  the  experimenter  to  announce  too  fev; 
significant  results  if  the  t-table  is  used  as  if  sampling 
from  a  normal  distribution.   However,  since  the  probability 
of  making  a  false  rejection  y..,  v;hen  actually  testing  at 
the  a  level  of  significance  has  nov;  been  determined  for  the 
Weibull  case,  the  problem  of  too  few  significant  results 
can  be  overcome  for  any  sample  size  by  finding  the  critical 
t  value  corresponding  to  the  desired  level  of  significance 
y...   This  procedure  will  be  dem^onstrated  in  an  example  in 
the  next  section. 

In  order  to  obtain  a  better  estim.ate  o-f  the  probability 
of  rejecting  a  true  hypothesis  at  the  .0005  level  of 
significance,  more  samples  are  needed.   At  this  level  of 
significance  with  10,000  sam.ples,  the  expected  number  of 
rejections  is  only  five.   The  ar:Ount  of  deviation  from,  the 
expected  number  of  rejections  was  such  that  no  rejections 
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frequently  occurred  in  10,000  samples.   This  implies  that 
the  probability  of  rejecting  a  true  hypothesis  is  zero 
when  testing  at  the  .0005  level.   However,  based  on  the 
hypothesis  that  the  observed  number  of  rejections  is  equal 
to  or  less  than  the  expected  number  of  rejections,  the 
probability  of  rejecting  a  true  hypothesis  when  testing 
at  the  .0005  level  is  bounded  between  zero  and  .0005. 

The  fact  that  the  r..  increased  as  the  sample  size 
Increased  is  supported  by  the  Central  Limit  theorem.   For 
large  samples,  the  "pseudo  t-distribution"  formed  by 
sampling  from  a  Weibull  distribution  asymptotically 
approaches  a  normal  distribution.   Since  this  is  also  true 
of  a  "real  t-distribution"  where  sampling  is  from  a  normal 
distribution,  the  observed  number  of  rejections  obtained  by 
sampling  from  a  Weibull  distribution  will  approach  the 
expected  number  of  rejections  for  increasing  sample  sizes. 
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VII.   EXAMPLE 

To  illustrate  the  method  of  analysis,  and  to  demonstrate 
a  procedure  for  using  a  t-table  to  estim.ate  a  critical  t 
value  when  sam.pling  from  a  Weibull  distribution  consider 
the  following  example. 

From  Appendix  B  the  value  yj.^   =  .0100  implies  that  for 
samples  of  size  four,  and  testing  at  the  .05  level  of 
significance  with  a  one  tailed  t-test,  that  the  probability 
of  a  type  one  error  is  estimated  to  actually  be  .0100  when 
sampling  is  from  a  Weibull  distribution.   If  y^^   =  .0100 
then  r^ic-  =  100  which  implies  100  observed  rejections  of  the 
niAll  hypothesis  Hq  :  y <_1 . 0 .  Pilling  in  Table  IV  gives  the  re- 
sults below. 

Number  of  times    Number  of  times 

H   accepted       H   rejected     Total 
o     ^  o    "^ 

Expected  number        9500  500         10000 

Observed  num.ber        9900  100         10000 


wv.      nv.'    Q  +.  ^ .  ^  .   K         2   (500-100)^  ^   (500-100)^ 

The  Chi  Square  statistic  becomes  y  =    ^rr^n +  nmn - 

500         9500 

336.   Since  336  is  larger  than  the  critical  Chi  Square  with 
one  degree  of  freedom  at  either  the  .05  or  .01  level  of  sig- 
nificance,  the  null  hypothesis  H   :  r^^   =    500  is  rejected. 
It  can  then  be  concluded  that  when  sampling  from  a  V/eibull 
distribution  with  X  =  1,  3  =  1  and  testing  with  a  one  tailed 
t-test  at  the  .05  level  of  significance  for  samples  of  size 
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four,  that  the  observed  number  of  rejections  In  significantly 
different  from  the  expected  number  of  rejections  had  the 
sample  been  from  a  normal  distribution.   Consequently,  the 
probability  of  a  type  I  error,  when  sampling  from,  the  above 
Welbull  distribution,  is  significantly  different  from  the 
expected  probability  of  a  type  I  error  when  sampling  from,  a 
normal  distribution.   In  fact,  since  the  stronger  hypothesis 
K  ':  Tuj-    <_   500  can  be  accepted,  the  probability  of  a  type 
I  error  under  the  above  conditions  is  less  than  the  prob- 
ability of  a  type  I  error  when  sampling  from  a  norm.al  dis- 
tribution.  As  previously  mentioned,  this  will  cause  the 
experimenter  to  announce  too  few  significant  results  if  a 
t-table  is  used  as  if  sampling  from  a  normal  distribution. 
The  experimenter  is  now  faced  with  the  problem  of  determin- 
ing a  critical  value  that  corresponds  to  the  probability  of 
a  type  I  error  for  samples  from  a  Weibull  distribution. 
Returning  to  Appendix  B  to  test  the  null  hypothesis 

H  :  y  <  y   for  samples  of  size  four  at  the  .05  level  of 
o    —  o        ^ 

significance,  where  it  is  known  that  sampling  is  from 
Weibull  distribution,  it  is  observed  that  Ykc  =  0.05  falls 
between  the  a  =  0.15  and  a  =  0.10  columns.   By  entering  a 
t-table  at  either  the  a  =  0.15  or  a   =  0.10  level,  for 
samples  of  size  four,  the  experimenter  will  obtain  an 
estimate  of  the  critical  value  that  corresponds  to  testing 
at  the  0.05  level  of  significance  for  sam.ples  from  a 
Weibull  distribution.   Choosing  a   =  0.15  will  result  in  a 
larger  critical  value  and  a  more  conservative  test. 
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VIII.   EXTENSIONS 

The  choice  of  the  Weibull  distribution  was  mostly  arbi- 
trary, even  though  reference  was  made  to  its  application  in 
reliability  theory  and  life  testing.   The  possibilities  for 
extending  this  investigation  to  other  non-normal  distribu- 
tions are  numerous.   In  addition  to  the  common  distributions 
with  known  distribution  functions,  bimodal  and  truncated 
distributions  warrent  investigation.   It  would  also  be 
interesting  to  examine  the  robustness  of  the  t-test  when 
sampling  from  non-normal  distributions  when  two  means  are 
being  compared.   Also  of  interest  would  be  the  case  where 
the  two  samples  are  from  different  non-normal  distributions. 
As  indicated,  the  possibilities  for  extensions  are  numerous 
and  limited  only  by  the  experimenters  time,  interest,  and 
needs . 
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